WORDS AND MORPHISMS WITH STURMIAN ERASURES 



FABIEN DURAND, ADEL GUERZIZ, AND MICHEL KOSKAS 



Abstract. We say x £ {0, 1, 2} N is a word with Sturmian erasures if for any 
a G {0, 1, 2} the word obtained erasing all a in a; is a Sturmian word. A large 
family of such words is given coding trajectories of balls in the game of billiards 
qq ] in the cube. We prove that the monoid of morphisms mapping all words with 

Sturmian erasures to words with Sturmian erasures is not finitely generated. 
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1. Introduction 



In this paper we are interested in words x defined on the alphabet A% = {0, 1, 2} 
having the following property: For any letter a G A%, the word obtained erasing 
all a in x is a Sturmian word. We say x is a word with Sturmian erasures. 
Sturmian words are well-known objects that can be defined in many ways. For 
example, a word is Sturmian if and only if for all n G N the number of distinct 
finite words of length n appearing in x is n + 1 (see [Lo] for complete references 
about Sturmian words). Sturmian words can also be viewed as trajectories of 
balls in the game of billiards in the square. We will see that a large family of 
words with Sturmian erasures is the family of trajectories of balls in the game of 
00 \ billiards in the cube. 

Here we are interested in the morphisms / : A 3 — > A% (the free monoid gener- 
ated by A3) that send all words with Sturmian erasures to words with Sturmian 
erasures. We call such / the morphisms with Sturmian erasures and we denote 
by MSE the set of all these morphisms. Our main result is the following: 

00 ' 

Theorem 1. We have: 

(1) The monoid MSE is not finitely generated; 
^ ■ (2) MSE is the union o/MSE e and the set of permutation of A 3 . 

(3) If f : A3 — > A% is locally with Sturmian erasures such that f(i) is the 
empty word for some j 6 A3 then it is a morphism with Sturmian erasures; 

Where MSE £ is the set of morphisms with Sturmian erasures having the empty 
word as an image of a letter and locally with Sturmian erasures means that there 
exists a word with Sturmian erasures such that f(x) is a word with Sturmian 
erasures. 

We recall that F. Mignosi and P. Seebold proved in |MS| that the monoid of the 
morphisms sending all Sturmian words to Sturmian words is finitely generated. 
In the last section we give some other informations about the words with Stur- 
mian erasures: symbolic complexity, link with the game of billiards in the cube, 
balanced property and palyndroms. 
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2. Definitions, notations and background 

2.1. Words, morphisms and matrices. We call alphabet a finite set of ele- 
ments called letters. Let A be an alphabet and A* be the free monoid generated 
by A. The elements of A* are called words. The neutral element of A*, also called 
the empty word, is denoted by e. We set A + = A* \ {e}. Let u = UqU\ ■ ■ ■ u n _i 
be a word of A*, Ui G A, < i < n — 1. Its length is n and is denoted by \u\. 
In particular, |e| = 0. If a & A then \u\ a denotes the number of occurrences of 
the letter a in the word u. We call infinite words the elements of A N and we set 
A 00 = A N U A*. Let x G A°° and y £ A* . We say that ?/ is a /actor of x if there 
exist u G A* and t> G A°° such that x = uyv . In particular if u — £ then y is a 
prefix of x and if t> = e then ?/ is a suffix of x. An infinite word x = (x n ; n G N) 
of A N is called eventually periodic if there exist two words u G A* and t> G A + 
such that x = uvvv .... 

The complexity function of an infinite word x is the function P x : N — > N where 
P x {n) is the number of factors of length n of x. 

Let A, B and C be three alphabets. A morphism f is a map from A to 5*. It 
induces by concatenation a map from A* to S*. If f(A) is included in B + , it 
induces a map from A N to i? N . All these maps are also written /. 
To a morphism / : A — ► 5* is associated the matrix Mj = (rriij)i£B,jeA where 
rriij is the number of occurrences of i in the word f(j). If g is a morphism from 
B to C* then we can check we have M go f = M g Mf. 

2.2. Sturmian words and Sturmian morphisms. Let A be a finite alphabet. 
An infinite word x G A N is Sturmian if for all n G N, P x {n) — n + 1. Since 
P x (l) = 2, we can suppose A = {0, 1} (see |Loj for more informations about 
these words). 

A morphism / from A to A* is Sturmian if for all Sturmian word x the word f(x) 
is Sturmian. A morphism / is locally Sturmian if there exists at least a Sturmian 
word x such that f(x) is Sturmian. We call St the semigroup generated by the 
morphisms E, <p, and (p defined by 

E : A* — ► A* ip : A* — ► A* (p : A* — ► A* 

i — ► 1 i — ► 01 i — ► 10 

1 i — > 1 i — > 1 i — ► 

Theorem 2. |BSl IMSj The following three conditions are equivalent 

(1) / e St; 

(2) / locally Sturmian; 

(3) / Sturmian. 

2.3. Words with Sturmian erasures. Let A% = {0, 1, 2} and let x be a infinite 
word of A3. For i G A3 we denote 7Tj : A3 — > A3 the morphism defined by 
7Ti(j) = j if j G A 3 with j ^ i and 7^(1) = e. 

Definition 3. An infinite word x G A3 is called word with Sturmian erasures 
if and only if the word 7Ci(x) is a Sturmian word for all i G A3. We say 
f : A 3 — > A3 is a morphism with Sturmian erasures if f(x) is a word with 
Sturmian erasures for all words x G A3 with Sturmian erasures. 

We call WSE the set of words with Sturmian erasures and MSE the set of mor- 
phisms with Sturmian erasures. We remark MSE is a monoid for the composition 
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law of morphism. The image of a Sturmian word by a morphism with Sturmian 
erasures is a word with Sturmian erasures. Hence WSE is not empty. 

Example 1. Let g : A 3 — > A3 be the morphism defined by : g(0) = 02, g(l) = 10 
and 0(2) = e. Let F = and for n > F n+1 = <p{F n ). Let F G {0, 1} N be 
the unique fixed point of (p in {0, 1} N (see Qu| ). Then for each n > F n is a 



prefix of F, and we have F = 0100101001001.... This word is called the Fibonacci 
word (remark that |-F n +2| = l-^n+il + \F n \, n > 0). It is a Sturmian word. From 
Theorem [2] we deduce that 

g(F) = 0210020210021002021002021002100202100210. . . 

is a word with Sturmian erasures. Hence WSE is not empty. 

Let x G Af be a word with Sturmian erasures. The word tt 2 (x) is a Sturmian 

word and g o 7r 2 = g. Moreover 7r 2 9\{o,i\ is a Sturmian morphism. Hence 

7r 2 o g(x) = n 2 ogo tt 2 (x) = tc 2 o 5f| {0 ,i}(vr 2 (x)) = </|{o,i}(7r 2 (a:)) 

is a Sturmian word. We can also show that tiq og{x) and 7Ti og(x) are words with 
Sturmian erasures. Hence, g is a morphism with Sturmian erasures and MSE is 
not empty. 

3. Proofs of points (JS]) et of Theorem rjj 

We denote by MSE e the set of morphisms of MSE such that there exists I G A3 
with f(l) = e. We will prove that MSE is the union of MSE £ with the set of 
permutations on A3. This last set is generated by 

E : A3 — ► A* E x : A* — ► A* E 2 : A* — ► A* 

1 — >0 1 — > 2 1 — el 

1 1 — > 2 1 1 — > 1 1 1 — ► 

2 1 — ► 1 2 1 — ^0 2 1 — > 2 

We denote by MSEj, z G A3, the set of morphisms with Sturmian erasures such 
that f(i) = e. We have MSE £ = MSE U MSEi U MSE 2 . 

3.1. Proof of the point ([3]) of Theorem [D We start with the following 
proposition. 

Proposition 4. If a morphism f : {0, 1} — > A3 maps a Sturmian word defined 
on {0, 1} N into a word with Sturmian erasures then it maps any Sturmian word 
into a word with Sturmian erasures. 

Proof. Let x G {0, 1} N be a Sturmian word and / : {0, 1} — > A3 be a morphism 
such that f(x) is a word with Sturmian erasures. 

Let i be a letter of A 3 and E : A 3 — > {0, 1}* be a morphism such that 
{0,1} = {E(a);i 7^ a, a G A3}. Then E o m o f(x) is Sturmian and 
E o o f : {0, 1} — > {0, 1}* is a locally Sturmian morphism. Hence, from 
Theorem [2] it is Sturmian. It follows that for every Sturmian word y, f(y) is a 
word with Sturmian erasures. This ends the proof. □ 

Now we prove the point (j3J) of Theorem [Q Let / : A 3 — > A3, such that f(i) = e 
for some i G A 3 , and x G A3 be a word with Sturmian erasures such that f(x) is 
a word with Sturmian erasures. We remark that we have / o 7r, ; (x) = f(x) and 
that iti(x) is a Sturmian word. 
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We can suppose A 3 \ {i} = {0, 1}. Hence the morphism / o tt^ |{o,i> satisfy the 
hypothesis of Proposition HI Consequently if y is a word with Sturmian erasures 
then f (y) = f o 7Ti(y) = f o 7Ti[{o u(7Tj(y)) is a word with Sturmian erasures. □ 

Example 2. We can remark that there exist morphisms / : A 3 — > such that 
for some word x G WSE we have f(x) G WSE but / is not a morphism with 
Sturmian erasures. 

For example let F be the Fibonacci word, / be defined by /(0) = 0, /(l) = 1 

and /(2) = 012, g : A 3 -> A* be defined by g(0) = 01, #(1) = 02 and g(2) = e, 

and, h : A 3 -> A* be defined by /i(0) = 02, = 10 and fc(2) = e. 

As in Example 1 we can prove that g, h and fog are morphisms with Sturmian 

erasures and consequently x = g(F) and f(x) = fog(F) are words with Sturmian 

erasures. 

But we remark that foh(F) is not a word with Sturmian erasures. Indeed 001210 
is a prefix of f(y) and 00110 is a prefix of w — TT 2 f(y). Consequently P w (2) = 4 
and w is not a Sturmian word. 

3.2. Proof of the point ([2]) of Theorem [H We need the following lemma 
that follows from Theorem [2] and the fact that the determinant of the matrices 
associated to ip, <p and E belong to {—1, 1}. 

Lemma 5. Let M be the matrix associated to the Sturmian morphism f . Then 
detM = ±1. 

Let us prove the point (J2]) of Theorem [IJ This proof is due to D. Bernardi. 
Let / be a morphism of MSE. Let % G A 3 and E : A 3 \ {i} -> {0, 1}* be a 
morphism such that {0, 1} = {E(a); a ^ i, a G A 3 }. We set h = Eon i o fog^ 01 -^ 
where g : A 3 — > A3 is the morphism defined by: p(0) = 02, p(l) = 12 and 

0(2) = e. 

As the letter i does not appear in the images of 7Tj o /, we consider i\i o / as a 
morphism from A 3 to (A 3 \ {z})*. We set M n . j = (u,v,w) where u, v and w 
are column vectors belonging to 1R 2 . We recall g is a morphism with Sturmian 
erasures (see Example 1 of the subsection [23]). Hence the morphism h is Sturmian 
and we have Mh = (u + w,v + w). From Lemma [5] 

det(w, v) + det(u, w) + det(w, v) = det(w + w, v + w) = ±1. 

We do the same with g being one of the two following morphisms : (0 1 — > 01, 
1 1 — > 12, 2 i-> e) and (0 1 — ^ 02, 1 1 — > 01, 2 e). We obtain finally 

(1) det(it, v) + det(w, w) + det(v, w) — ±1. 

(2) det(tt, f ) + det(u>, -u) + det(w, v) = ±1. 

(3) det(it, u) + det(w, io) + det(w, f ) = ±1. 

The combinations of the equations (1) and (2), (2) and (3), and, (3) and (1) 
imply respectively that det(u, v), det (w,u) and det(w,v) belong to { — 1,0,1}. 
From (1), one of three determinants det(u,v), det(«, w) or det(v,w) is different 
from 0. 

We suppose det(u,v) 7^ (the other cases can be treated in the same way). The 
set {u,v} is a base of R 2 , hence there exist two real numbers a and b such that 
w = au + bv. We have a = det(w,v)/ det(u, v) and b = det(w,u)/ det(v,u). 
Moreover from (1) and (2) we see that det(u,w) + det(v,w) and — (det(u, w) + 



WORDS AND MORPHISMS WITH STURMIAN ERASURES 



5 



det(v , w)) = det(w, u) + det(u>, v) belong to {det(t>, u) — 1, det(t>, u) + 1} which is 
equal to {0,2} or {—2,0}. Consequently det(u, w) + det(v,w) = 0. Hence a = b 
and w = a(u + v). The vector w is the column of the matrix of a morphism 
therefore it has non-negative coordinates which implies that a is non-negative. 
Suppose a > 0. Then det(w, v) and det(u, v) are positive and have the same sign. 
Hence one of the equations (2) or (3) is equal to -3 or 3 which is not possible. 
Consequently a = and w — 0. 

Therefore for all i G A 3 the matrix M n . f = (rrii(c, d)) ce A 3 \{i},deA 3 has a column 
(toj(c, di)) ce A 3 \{i} with entries equal to 0. 
Two cases occurs. 

1- There exists i,j G A 3 (i ^ j) such that di = dj. In this case we easily check 
that f(di) = e. Consequently / belongs to MSE £ . 

2- The sets {^0,^1,^2} and A 3 are equal. In this case we can check that / is a 
permutation. □ 

4. Prime morphisms 

4.1. Some technical definitions. Let A be an alphabet and / : A — > A* be 
a morphism. A letter a is called f -nilpotent if there exists an integer n such 
that f n (a) = e (if it is not ambiguous we will say it is nilpotent). The set of 
/-nilpotent letters is denoted by Aff. We call Vf the set of letters a such that 
there exists an integer n satisfying 7Tj\f f (f n (a)) = a where TTj\f f (b) = e if b e Aff 
and b otherwise. The set of such letters is denoted by Vj. 

We say the letter a is f -permuting if there exists an integer n such that 
f n (a) E (A/> U Vj)* \ Aff*. We denote by V f the set of such letters. We re- 
mark that Vj is included in Vf. 

A letter a is called f -expansive, or expansive when the context is clear, if it is 

neither nilpotent nor permuting. We remark the letter a G A is /-expansive 

if and only if lim^+oo |/ n (a)| = +00 and it is /-permuting if and only if the 

sequence (|/ n (a)|;n G N) is bounded and is never equal to 0. 

The morphism / is nilpotent if f(A) is included in Aff, i.e., if there exists an 

integer n such that f n (a) = e for all a G A. A morphism / is called expansive if 

there exists a /-expansive letter. A morphism / is a unit if it is neither nilpotent 

nor expansive. In others words if f(A) is included in (Aff U Vf)*. 

Let AA. be a monoid of morphisms. A morphism / G AA is said to be prime in 

AA if for any morphisms g and h in AA such that f — g o h, then g or /i is a 

unit of A4. We say that / is of degree n in AA, n G N, if any decomposition of 

/ into a product of prime and unit morphisms of AA contains at least n prime 

morphisms and there exists at least one decomposition of / into prime and unit 

morphisms of AA containing exactly n prime morphisms. 

The set of prime morphisms in St is {/ o g o h; f, h G {Id, E}, g G {</?, 

4.2. Some conditions to be a prime morphism. In the sequel we need the 
following morphisms which are extensions to the alphabet A 3 of the morphisms 

if and tp: 

A* 3 — > A* 3 , fp{ : A3 — > Al 

1 — > 01 1 — > 10 

1 1 — > 1 1 — > 

2 1 — > e 2 1 — > e . 
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In the next proposition we need the following lemma. 

Lemma 6. Let g G MSE 2 . Then, \g(a)\ > 2, \g(a)\o + \g(a)\i > 1 and 
\g{01)\ a > 1 for all a G {0,1}. 

Proof. Suppose for a G {0, 1} we have \g(a)\ = 1, for example g>(a) = 6. Then 
7Tfe o g{x) is periodic for all x G A3 . This contradicts the fact that g belongs to 
MSE 2 . If \g{a)\ = we have the same conclusion. This proves the first part of 
the lemma. 

Suppose |g(a)| + = 0. Then 7r 2 o g(x) is periodic for all x G A3. This 

proves the second inequality. 

Suppose |gr(01)| o = 0, then a does not appear in g(x). This contradicts the fact 
that g belongs to MSE 2 . □ 

Proposition 7. Let i G A 3 and f G MSEj. We set A 3 = {i,j, k}. 

1) If f(j) is neither a prefix nor a suffix of f(k) and that f(k) is neither a prefix 
nor a suffix of f(j), then f is prime in MSEj. 

2) Moreover, if f is prime in MSEj and if we have |/(012)|j > |/(012)| fe > 
|/(012) \i, then f(j) is neither a prefix nor a suffix of f(k) and f{k) is neither a 
prefix nor a suffix of f(j). 

Proof. We only make the proof in the case i = 2. 

1) We suppose /(0) is neither prefix nor suffix of /(l) and that /(l) is neither 
prefix nor suffix of /(0). We proceed by contradiction, i.e., we suppose there 
exist g,h G MSE 2 which are not units such that / = g o h. 

Let hi = 7r 2 o h. We have g o h — g o h\. We define hi : {0, 1} — > {0, 1}* by 
hi(i) = hi(i) for alH G {0, 1}. We remark that hi is a Sturmian morphism hence 
it is a product of ip,ip and E (Theorem [2]). Therefore hi is a product of fi,<fi 
and 7r 2 o E 2 . 
We consider two cases. 

Suppose hi G" {n 2 o E 2 , 7r 2 o E 2 o E 2 }. Then, for example, hi is equal to h 2 o <p x 
where h 2 is a product of ipi, <fi and tt 2 o E 2 . The other cases (hi = h 2 o ^ or 
hi = h 2 o o 7r 2 o i5 2 or hi = h 2 o Tp{ o 7r 2 o £J 2 ) can be treated in the same way. 
We have /(0) = g o h 2 o (fi(0) —go h 2 (0)g o h 2 (l) and /(l) = g o /i 2 (0). which 
contradicts the hypothesis. 

Suppose hi G {n 2 o E 2 , n 2 o E 2 o i? 2 }, then /ii(0) = 1, ^i(l) = and hi(2) = e or 
/ii(0) = 0, /ii(l) = 1 and hi(2) = e. In both case we easily check that h is a unit 
of MSE 2 . This ends the first part of the proof. 

2) We now suppose / is a prime morphism in MSE 2 such that |/(01)|j > 
|/(01)| fc > |/(01)| 2 , where A 3 = {j,k,2}. 

We proceed by contradiction. We suppose /(l) is a prefix of /(0). The other case 
can be treated in the same way. There exist u and v in A3 such that /(0) = uv 
and /(l) = m. We define /i : A 3 — >• A3 by g(0) = u, g(i) = v, g(2) = e, 
h(0) = 01, h(l) = 02 and h(2) = e. We remark that h is not a unit and f — goh. 
To end the proof it suffices to show that g is not a unit of MSE 2 . We start 
proving g belongs to MSE 2 . 

Let x G WSE. As in Example 1 we can prove that h belongs to MSE 2 . 
Consequently h(x) belongs to WSE. Moreover f(x) = g(h(x)) belongs to WSE. 
From the point ([3]) of Theorem [1] it comes that g belongs to MSE 2 . From 
Lemmalwe have |/(01)| = |#(010)| > 6. Consequently |/(01)| + |/(01)|i + 
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1/(01) I2 > 6. From the hypothesis it comes that |g(010)|j = |/(01)|j > 3. Hence 

2^(01)1,- - 1^(1)1,- > 3 and > 2. 

Now we prove by induction that for all n G N we have 

|vr 2 o £ n (01)| > n + 2, |tt 2 o # n (0)| > 1, and |tt 2 o g n (l)\ > 1. 

This is true for n — 0. We suppose it is true for nGN. From Lemma [6] we have 

|7T 2 o<f +1 (01)| = |tt 2 o^(01))| > \n 2 og n (jjk)\ = |7r 2 o^(01)| + |7r 2 o^(j)| > n+3. 

Moreover, from Lemma [6] in g(j) occurs a letter a G {0, 1}. Consequently, 

\7r2 g n+1 (j)\ = W20 g n {g{j))\ > \^og n (a)\ > 1. 

We proceed in the same way for the letter k. This concludes the induction. 
Therefore, it is clear g is expansive. This concludes the proof. □ 

5. The monoid MSE is not finitely generated 

5.1. Some preliminary results. To prove the point (jTJ of Theorem [T] we need 
the following subset of MSE. Let MSE be the set of morphisms / G MSE 2 such 
that for some nGN 

tt 2 o / G F n , txi o / G G n and n o f e H„ where F n = {(pt, (pl} n , 

G n = Eqo {</?!, ^1} o E 2 o {</?!, ^I} 71 " 1 and H n = E 2 oE o {<^ 1; ^T} 71-1 . 

With the two following lemmata we prove that MSE is not empty. Before we 
need a new definition and we make some remarks. 

Let u G {0, 1}*, v G {0, 2}* and 10 G {1, 2}* be three words. We say that u, v and 
w intercalate between them if and only if there exists x G A^ such that 7r 2 (x) = u, 
tti(x) = v and n (x) = w. 

Let (u„)neN be the Fibonacci word : w n+ i = w n + for all n > 1, uq = and 
Ui = 1. We can remark that for all n > 1 we have 

w n 
u n u n -i . 


Lemma 8. Let n > 2, f G -F n; (7 G G n and ft, G H n . Then, for all a G {0, 1} we 
have \f(a)\ = \g(a)\ , |/(a)|i = |/i(a)| a and |p(a)| 2 = |/i(a)| 2 . 

Proof. It suffices to remark that Mf = M^, 





U n +1 


U n 
















M g = 











and Mh = 




U n -1 







U n -1 


U n -2 









U n -2 






□ 

Lemma 9. Let /, g and h be three morphisms from A3 to A? 3 such that f(a),g(a) 
and h(a) are respectively words on the alphabets {0, 1} ; {0,2} and {1,2} for all 
a G A3. Then, f(a), g(a) and h(a) intercalate between them for all a G A3 if and 
only if there exists a morphism ip : A3 — > A^ such that 7i~ 2 o ip = f , m o i/j — g 
and tt o ip = h. 

Proof. For all a G A3 let ip(a) be the word obtained intercalating /(a), g(a) and 
h(a). This defines a morphism ip : A3 — > A3. We can check it satisfies 7r 2 o-0 = /, 
7Ti o ^ = g and 7i"o o -0 = h. The reciprocal is left to the reader. □ 
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Lemma 10. For all n G N* ; (pi(l) is a prefix but not a suffix o/(/?"(0). And for 
all n G N*\{1} if g = E o Tp\ o E 2 o ^i™ 1 then g(l) is a suffix but not a prefix 
ofg(0). 

Proof. Let n G N*. We have pj(0) = ^ _1 (01) = ^(l)^"^). Hence $J(1) is 
a prefix of <fi(0). 

We proceed by induction to prove that <£>i(l) is not suffix of (pi(0). For n = 1 is 
it clear. Suppose it is true for n G N*. We prove it is also true for n + 1. 
We have ^ +1 (0) = ^ +1 (1)^(1) and ^ +1 (1) = p?(0). Suppose <^ +1 (l) is a 
suffix of < +1 (0). Looking at M v n we remark that |<^"(1)| < |^i(0)|, therefore 
<Pi(l) is a suffix of <fi(0) which contradicts the hypothesis. This concludes the 
first part of the proof. The other part can be achieved in the same way. □ 

Lemma 11. Let n G N* ; /„ = <p™, g n = E o Tp[ o E 2 o (pi n ~ and h n = E 2 ° 
Eq o cpi n ~ . Then there exists a morphism if> n G MSE 2 such that ir 2 ° if> n — fn, 
71*1 ° ipn = g n and 7r o if> n = h n . 

Proof. We easily check that if if) is a morphism such that 7r 2 oif) = / n , m oif> = g n 

and 7r o if) = h n , for some neN, then if) belongs to MSE 2 . 

We proceed by induction on n to prove what remains. For n = 1, we have 

A : A* 3 — A3, ^ : A| — > A3, : A3 — ► A*, Vi 

1 — >01 1 — >0 1 — >1 

1 1 — ► 1 1 — >20 1 1 — >2 

2 1 — > e 2 1 — > e 2 1 — 



The morphism ^1 is such that 1^2 if>i = fi, i>i = g 1 and 7r 
consequently if)\ belongs to MSE2. For n = 2, we have 



o 



A* — > 


A* 


1 — ► 


01 




20 


2 1 — > 


e. 


if>i = hi 


, and 



f 2 : A* — ^A*, ^ 2 : A* — > A*, /i 2 : A* — A*, : A* —.A* 
1 — >010 1 — >200 1 — >21 1 — >2010 



Al g 2 : 


A* 


010 





01 


1 


£ 


2 



1 1 — >01 1 1 — >0 1 1 — >1 1 1 — >01 

2 1 — 2 1 — 2 1 — 2 1 — >e. 

The morphism ?/> 2 is such that tt 2 o ?/> 2 = / 2 , 7ti o ip 2 = g 2 and 7r o ^ 2 = /i 2 > an d 
consequently ?/> 2 belongs to MSE 2 . Now we suppose the result is true for n — 1 
and n > 2. We prove it is also true for n + 1. We have 



/ n+1 (0) = / n _!(0)/ n _i(l)/ n _i(0), / n+ i(l) = / n (0), / n+ i(2) = e, 

5-n+l(0) = flf n _i(0)p n _i(l)^ n _i(0), flfn+l(l) = fl'n(O), 0n+l( 2 ) = £ ' 

^n+i(0) = /i n _i(0)/i n _i(l)/i n _i(0), h n+ i(l) = h„,(0) and /i„+i(2) = e. 

From the induction hypothesis and Lemma M we know fi(a), gi (a), /ij(a) in- 
tercalate between them for all a G A 3 and all i G {n — Consequently, 
using Lemma [HI there is a morphism if> : A 3 — > A3 such that 7r 2 o ^ n+ i = / n +i, 
7Ti o if> n+1 = g n+1 and ir o ^ n+1 = fo n+1 . □ 

Proposition 12. For all n G N*, the morphism if) n defined in LemmaUTlis prime 
znMSE 2 . 

Proof. We keep the notations of Lemma [TT1 Let n G N*. From Proposition [7] it 
suffices to prove that V'n(l) is neither a prefix nor a suffix of if) n (0). We proceed 
by contradiction: We suppose if> n (l) is a prefix or a suffix of if> n (0). 
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Suppose that ipniX) is a prefix of if) n (0). Then 7Ti o if) n (l) is a prefix of 7Ti o ip n (0) 
and consequently g n (l) is a prefix of p n (0). This contradicts Lemma [TU1 
Suppose that V'n(l) is a suffix of Vv(0). Then 7T20^ n (l) is a suffix of vr 2 o^ n (0) and 
consequently / n (l) is a suffix of / n (0). This contradicts Lemma [TUl and proves 
the lemma. □ 

Corollary 13. The set MSE2 contains infinitely many primes. 

Proof. We left as an exercise to prove that for all n G N* we have ip n 7^ ipn+i, 
where ip n is defined in Lemma [TTJ Proposition [12] ends the proof. □ 

5.2. Proof of the point (CQ) of Theorem [H We proceed by contradiction: We 
suppose there exists F = {/1, ...,//} C MSE generating MSE, i.e., all g G MSE 
is a composition of elements belonging to F. 

Let AT — swp a £A3,i<i<l \ fi(. a )\i (V'n)neN be the morphisms defined in Lemma 1111 
and (w n )„ e N be the Fibonacci word defined in the previous section. We remark 

lim max |?/> n (a)| > lim u n+ \ = +00. 

We fix n G N such that max ag A 3 IV'fi(a) I > By hypothesis there exist <?i, • • • , <?& 
in F such that ip n = gi°- ■ -°gk- We set h = g 2 o- ■ -og k . The morphism ^ n belongs 
to MSE a for some a G A3. It implies VVi(2) = ip n {a) — £ and consequently a = 2. 
There exists 6 G A 3 such that #i G MSE;,. We remark %j) n = g\ o h = g\ o ^ h o h. 
Two cases occurs. 

First case: For all a G {0, 1} we have | tt^ o h(a)\ = 1. 

The morphism h being a morphism with Sturmian erasures we cannot have 
7Tb o h(0) = 7Tb o h(l). Consequently ip n = g\ o E 2 or ip n = g\. This implies 
there exists a G A3 such that |<7i(a)| > N which is not possible. 

Second case: There exists a G {0, 1} such that 1 o h(a)\ > 1. 
We remark 'n^oh = -n^oho -n 2 

If 6 = 2 then 7Tf, o fy{o,i} : {0, 1} — > {0, 1}* C A3 is a Sturmian morphism different 
from E and Idm^y. Hence from a remark we make in Subsection 13.21 there exist 
% and j in {0, 1}, % 7^ j, such that the word 71^ o /ii{o,i}(«) is a prefix or a suffix of 
^fe fy{o,i}0)- Hence is a prefix or a suffix of ip n {j)- Proposition [7] implies 

ip n is not prime in MSE 2 which contradicts Proposition [T2J 
Let b ^ 2. We set {6, c} = {0, 1}. Then, E c on b o h\ {0 , 1} : {0, 1} -»■ {0, 1}* C A* 
is a Sturmian morphism different from F and Jd{o,i}. Hence from a remark we 
make in Subsection 13.21 there exist i and j in {0, 1}, i ^ j, such that the word 
E c o o fy{o,i}(*) is a prefix or a suffix of E c o K b o h\{Q tl \(j). Hence Vn(*) is a 
prefix or a suffix of ip n {j)- Proposition [7] implies ip n is not prime in MSE2 which 
contradicts Proposition [T2J 
This concludes the proof. 

6. Some further facts about words with Sturmian erasures 

6.1. Geometrical remarks. We recall that a Sturmian word can be viewed as 
a coding of a straight half line in M 2 with direction (1, a) where a is a positive 
irrational number, or in other terms as a trajectory of a ball in the game of 
billiards in the square with elastic reflexion on the boundary. We do not give the 
details here, we refer the reader to |Loj . 
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Let us extend the construction given in |Loj to obtain what is usually called 
billiard words in the unit cube [0, l] 3 . Let d = (d ,di,d 2 ) G [0,+oo[ 3 and 
p = (po,pi,p 2 ) G [0, 1[ 3 . Let D be the half line with direction d and intercept 
p that is to say D = {td + p;t > 0}. Consider the intersections of D with the 
planes x = a, y = a, z = a, a G Z: We denote by Io,Ii, ■ ■ ■ these consecutive 
intersection points . We say I n crosses the face F i: i G {0, 1,2}, if the i + 1-th 
coordinate of is an integer and the % + 1-th coordinate of I n+ \ — I n is not equal 
to 0. 

We set Q n = {i G {0, 1, 2}; I n crosses Fj}. Let x = u Ui ... be a word such that 



We say x is a billiard word in the unit cube [0, l] 3 (with direction d and intercept 
p). We can also say that x is a coding of D. Of course a half line can have several 
codings. One of the codings of a half line is periodic if and only if d G 7Z 3 for 
some 7 G R + . 

When one of the coordinates of the direction is equal to zero and the two others 
are rationally independent we can easily deduce from [LoJ (Chapter 2) that x is a 
Sturmian word. The reciprocal is also true: All Sturmian words can be obtained 
in this way (see [LoJ). 

We remark that if x is a non-periodic cubic billiard word then 710(2) is a Stur- 
mian word with direction d = (0,^1,6/2) and intercept p = (0,pi,p2) (i-e. the 
orthogonal projection of D onto {0} x [0, +00 [x [0, +oo[). We have the analogous 
remark for iri(x) and 772(2). It is easy to conclude that a cubic billiard word is a 
word with Sturmian erasures if and only if it is non periodic and d g]0, +00 [ 3 . 
There exist words with Sturmian erasures that are not cubic billiard words. For 
example, take the Fibonacci word x (Example 1) and the morphism ip defined 
by ^(0) = 0012 and ^(1) = 01. It is easy to see that y = ip(x) is a word with 
Sturmian erasures. Let us show that if y was a cubic billiard word then the word 
102 should appear in x, which is not the case. 

We briefly sketch the proof. Suppose y is a cubic billiard word with direction 
d = (l,a,/3) and intercept p, then the words 7r (y) and 7T 2 (y) are Sturmian 
words with respective directions (0, a,/3) and (l,a, 0). It can be shown that 
a = 9 — 1 and (3 = (9 — l) 2 where 9 = (y5 + l)/2. But with such a direction 
d — (1, 9 — 1, {9 — l) 2 ) easy calculus show that the word 102 should appear in y. 

6.2. Balanced words. Let us recall a characterization of Sturmian words due 
to Hedlund and Morse [HM2J. Let A be a finite alphabet. We say a word 
x G A N is balanced if for all factors u and v of x having the same length we 
have ||u| a — \v\ a \ < 1 for all a G A. Suppose CardA = 2. A word x G A N 
is Sturmian if and only if x is non eventually periodic and balanced. P. Hubert 
characterizes in [HuJ the words on a three letters alphabet that are balanced. This 
characterization shows that such words are not words with Sturmian erasures. 

Definition 14. Let A be a finite alphabet. We say x G A m is n-balanced if n is 
the least integer such that: For all words u and v appearing in x and having the 
same length we have ||w|a — | f |a|^ ^ f or all a E A. 

Clearly, Sturmian words are 1-balanced. 




i if fl n = {i}, 
ij if Q n = {i,j} where i 7^ j, 
13k ifQ n = {t,j,k} = {0,1,2}. 
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Proposition 15. If x G A% is a word with Sturmian erasures then x is non 
eventually periodic and 2-balanced. 

Proof. It is clear x is non eventually periodic. From a previous remark we know 
that x is not 1-balanced. 

Suppose x is n-balanced with n > 3: There exist e G A 3 and two words u and v 
appearing in x and having the same length such that ||it| e — \v\ e \ > 3. 
For all a G A% we set n(a) = \\u\ a — \v\ a \. Then we can set A3 = {a,b,c} 
where n(a) > 3 and n(a) > n(b) > 72(c). Without loss of generality we sup- 
pose \u\ a — \v\ a = n(a). As \u\ = \v\ we have n(a) = (\v\b — \u\b) + (\v\ c — 
\u\ c ). Consequently we necessarily have \v\b — \u\b > and \v\ c — \u\ c > be- 
cause n(a) > n(b) > n(c). Thus n(b) = \v\b — \u\b, n(c) = \v\ c — \u\ c and 
n(a) = n(b) + n(c). We also see that n{b) > 2 and n(c) > 0. 
Suppose there exists a factor u' of the word u verifying |7r c (u')| = |7r c (t>)| and 
Kc( w ')la — l 7r c(' u )|a > 2. Then this would say that vr c (x) is not balanced and a 
fortiori not Sturmian which would end the proof. 

Let us find such a v! . We have |vr c (u)| > |vr c (f)| > \v\b > 2. Hence there exists 
a non-empty word u' satisfying |7r c (V)| = |7r c (t>)| and having an occurrence in u. 
Moreover 

kcOOL + \ir c {u% = \ir c {u')\ = \n c (v)\ = \v\ - \v\ c 

= \u\ - \v\ c = \u\ a + \u\ b + \u\ c - \v\ c = \7T c (u)\ a + |7T c (w)| 6 - n(c). 

Hence 

\-K c {u)\ a - n(c) = |7T c («')|o + \^c(u')\b ~ W{u)\b < |7r c (u')|o 

and then 

2 < n(b) = n(a) - n(c) = \ic c (u)\ a - |7r c (u)| a - n(c) < |7r c (u')|o - W{v)\ a , 
which ends the proof. □ 

6.3. Complexity. Let 1 be a word with Sturmian erasures and / be a mor- 
phism belonging to MSE, for some i 6 A3. Then 7Ti(x) is a Sturmian word and 
f(x) = f(iTi(x)). Consequently from a result of Coven and Hedlund [CHJ we 
deduce there exist two integers and k such that P x (n) = n + k for all n > uq. 
For example, let F be the Fibonacci word and / : {0, 1, 2} — ► {0, 1, 2}* be the 
morphism defined by /(0) = 0102, /(l) = 01 and /(2) = e. It is a morphism 
with Sturmian erasures and y = f(F) is a word with Sturmian erasures. In fact 
it is a cubic billiard word with direction d — (1,9 — 1, (6 — l) 2 ) and intercept 
p = (0,9 — 1,(9 — l) 2 ), where 9 is the golden mean (y/b + l)/2. 

This does not contradict the result in |AMSTj saying that if 1, a and (3 are ratio- 
nally independent then the complexity of the cubic billiard word with direction 
(1, a, (3) and intercept p g]0, 1[ 3 is n 2 + n + 1, because — 1 + (a — 1) + (a — l) 2 = 0. 

6.4. Conclusion. Many generalizations of the Sturmian words were tried (more 
letters, applications of 1? to {0, 1}, ...) but none appeared to be entirely suitable 
in the sense that it seems impossible to extend these properties to a more general 
domain astonishing varieties of the properties characterizing these words. The 
example which we chose for this paper, does not derogate from this rule. Never- 
theless, the fact that MSE is not given by a finite generator shows a fundamental 
difference between the Sturmian words and any generalization with more than 
two letters because the definition adopted here was less "compromising" possible. 
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Furthermore, this definition gives a words of a complexity structurally similar to 
the one of the Sturmian words. 
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